How to select the largest k elements from evolving data?

نویسندگان

  • Qin Huang
  • Xingwu Liu
  • Xiaoming Sun
  • Jialin Zhang
چکیده

In this paper we investigate the top-k-selection problem, i.e. determine the largest, second largest, ..., and the k-th largest elements, in the dynamic data model. In this model the order of elements evolves dynamically over time. In each time step the algorithm can only probe the changes of data by comparing a pair of elements. Previously only two special cases were studied [2]: finding the largest element and the median; and sorting all elements. This paper systematically deals with k ∈ [n] and solves the problem almost completely. Specifically, we identify a critical point k∗ such that the top-k-selection problem can be solved error-free with probability 1 − o(1) if and only if k = o(k∗). A lower bound of the error when k = Ω(k∗) is also determined, which actually is tight under some condition. On the other hand, it is shown that the top-k-set problem, which means finding the largest k elements without sorting them, can be solved error-free for all k ∈ [n]. Additionally, we extend the dynamic data model and show that most of these results still hold.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Note on Evolutionary Rate Estimation in Bayesian Evolutionary Analysis: Focus on Pathogens

Bayesian evolutionary analysis provide a statistically sound and flexible framework for estimation of evolutionary parameters. In this method, posterior estimates of evolutionary rate (μ) are derived by combining evolutionary information in the data with researcher’s prior knowledge about the true value of μ. Nucleotide sequence samples of fast evolving pathogens that are taken at d...

متن کامل

Modified Convex Data Clustering Algorithm Based on Alternating Direction Method of Multipliers

Knowing the fact that the main weakness of the most standard methods including k-means and hierarchical data clustering is their sensitivity to initialization and trapping to local minima, this paper proposes a modification of convex data clustering  in which there is no need to  be peculiar about how to select initial values. Due to properly converting the task of optimization to an equivalent...

متن کامل

Evolution of Character in the Exegesis Attitude to Theosophical Interpretations of Quran; the Story of Ibrahim in Focus

Story is one of the best and most accessible ways which Qur'an uses for conveying the concepts and explaining its religious teachings. Quranic stories are considered as narrations a large proportion of which is devoted to the lives of the prophets. As character is one of the main elements of a narrative and also, since after Moses, the prophet Ibrahim is the most frequent character in Quranic s...

متن کامل

A Characterization of the Suzuki Groups by Order and the Largest Elements Order

One of the important problems in group theory is characterization of a group by a given property, that is, to prove there exist only one group with a given property. Let  be a finite group. We denote by  the largest order of elements of . In this paper, we prove that some Suzuki groups are characterizable by order and the largest order of elements. In fact, we prove that if  is a group with  an...

متن کامل

سنگ‌شناسی و ژئوشیمی گدازه‌های‌بالشی و سنگ‌های‌مافیک ، حدواسط و فلسیک وابسته به آن در دنباله‌ی افیولیتی صحنه- هرسین (شمال‌شرق کرمانشاه)

The ophiolitic complex of Sahneh-Harsin is composed of rock units such as gabbro, basalt, diabasic dikes, diorite and plagiogranite. The geochemical studies show that primitive magma has been sub-alkaline tholeiitic Mg-rich and K-poor. These data show enrichment of light rare earth elements (LREE) and depletion of heavy rare earth (HREE). The ratio of NbTa in the basalts is between 16.19 and 18...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1412.8164  شماره 

صفحات  -

تاریخ انتشار 2014